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Abstract — A belief-propagation decoder for low-density lattice 
codes is given which represents messages explicitly as a mixture 
of Gaussians functions. The key component is an algorithm 
for approximating a mixture of several Gaussians with another 
mixture with a smaller number of Gaussians. This Gaussian 
mixture reduction algorithm iteratively reduces the number 
of Gaussians by minimizing the distance between the original 
mixture and an approximation with one fewer Gaussians. 

Error rates and noise thresholds of this decoder are compared 
with those for the previously-proposed decoder which discretely 
quantizes the messages. The error rates are indistinguishable for 
dimension 1000 and 10000 lattices, and the Gaussian-mixture 
decoder has a 0.2 dB loss for dimension 100 lattices. The 
Gaussian-mixture decoder has a loss of about 0.03 dB in the noise 
threshold, which is evaluated via Monte Carlo density evolution. 
Further, the Gaussian-mixture decoder uses far less storage for 
the messages. 

I. Introduction 

Low-density lattice codes (LDLC) are lattices characterized 
by a sparse inverse generator matrix, making them suitable for 
belief-propagation decoding. Sommer, Feder and Shalvi pro- 
posed this lattice construction, described its belief-propagation 
decoder, and gave extensive convergence analysis. With decod- 
ing complexity which is linear in the dimension, it is possible 
to decode LDLC lattices with dimension of 10 6 . When used 
on the unconstrained-power AWGN channel, a noise threshold 
appeared within 0.6 dB of an asymptotic limit [1]. 

A notable aspect of belief-propagation decoding of LDLC 
codes on AWGN channels is that the messages are mixtures 
of Gaussian functions. This is appealing for a decoder imple- 
mentation, but the number of Gaussians in the mixture grows 
doubly exponentially in the number of iterations, and a naive 
implementation using Gaussians is infeasible. Thus, in prior 
work by Sommer et al., the decoder messages were quantized. 

This short paper describes a belief-propagation decoder 
for LDLC lattices which uses messages which are Gaussian 
mixture distributions. That is, each message consists of means, 
variances and mixing coefficients. The key part of this decoder 
is a Gaussian mixture reduction algorithm which approximates 
a Gaussian mixture distribution with several Gaussian compo- 
nents by another mixture with fewer such components. This 
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algorithm stops the doubly exponential growth in the number 
of Gaussians. 

This Gaussian mixture reduction algorithm compares the 
distance between all possible pairs of Gaussians in an input 
list, and replaces the closest pair with a single Gaussian. 
This proceeds iteratively until a stopping condition is reached. 
While the Kullback-Leibler (KL) divergence would be an 
appropriate distance measure, it does not have a closed form 
for Gaussian mixtures, and therefore it is not suitable for use in 
an efficient decoding algorithm. Instead, the squared distance 
is used, which can be computed in closed form. The special 
case of replacing two Gaussians with a single Gaussian, and 



the square distanced, is first described in Section II-A Then, 
the Gaussian mixture reduction algorithm, which is the general 
case with an arbitrary number of Gaussians, is described in 
Section IlLBl 

A related technique is the iterative pairwise replacement 
algorithm for reducing the order of a Gaussian mixture in 
kernel density estimation [2]. For compressed sensing, this al- 
gorithm was applied to find the solution to an underdetermined 
system of equations, when there is an a priori distribution 
on the unknowns [3]. This inference algorithm has some 
similarities with LDLC decoding. In parallel to our work, 
alternative methods to reduce the order of a Gaussian mixture 
have recently been proposed for use in statistical learning 
applications [4]. 

LDLC lattice construction and the unconstrained-power 



communication system is reviewed in Section III-A Then, 



Section III-B presents the LDLC decoder which represents 
messages using the means, variance and mixing coefficients 
of the component Gaussians. Gaussian mixtures were used 
by Sommer et al. to analyze decoder convergence; the main 
novelty here is to show how the Gaussian mixture reduction 
algorithm is incorporated into such a decoder. 

The error-rate performance of the Gaussian-mixture decoder 
is either indistinguishable from, or close to, that of the 
quantized-message decoder, as is numerically demonstrated in 
Section ITV-AI The error rates of the Gaussian-mixture decoder 
and the quantized-message decoder are indistinguishable for 
dimension 1000 and 10000 lattices, and there is a loss of about 
0.2 dB for dimension 100 lattices. Additionally, to place the 
error rates of LDLC lattices in context, comparisons are made 
with a universal lattice sphere bound. Dimension 100 LDLC 
lattices have a substantial gap with this bound, but this gaps 
decreases as the dimension increases. 

Noise thresholds for LDLC codes, found by Monte Carlo 
density evolution, are used to evaluate the complexity- 



performance tradeoff in Section IV-B In the high-complexity 
case, the noise thresholds for the Gaussian-mixture decoder 
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lose 0.03 dB with respect to the quantized-message decoder. 
On the other hand, complexity can be substantially reduced 
while decreasing the loss to 0.1 dB. In practice, the number 
of Gaussians in the mixtures was small, and so the Gaussian- 
mixture decoder requires far less storage than the quantized- 
message decoder. Discussion is given in Section [V] 

The following notation is used. A Gaussian distribution with 
mean m and variance v is denoted as: 

N(z\m,v) = e 2 " . (1) 



A message f(z) is a mixture of N Gaussians: 



(2) 



where Cj > are mixing coefficients with Xa=i c * = !• 
An equivalent representation of f(z) is a list of N triples 
of means, variances and mixing coefficients, {ti, . . . ,<at} = 
{(mi, vi, ci), . . . , (mjv, Vjf, c/v)}, and these two representa- 
tions will be used interchangeably. 

II. Gaussian Mixture Reduction Algorithm 

A. Approximating Two Gaussians with One Gaussian 

When approximating a true distribution p(z), by an approxi- 
mate distribution q(z), a reasonable approach is to choose q(z) 
in such a way that the KL divergence, KL(p| \q), is minimized. 
For distributions with support Z, the KL divergence is given 
by: 

p{z 



KL(p||q) 



pO) lo e ^-fr dz - 



(3) 



When q(z) is a Gaussian, selecting the mean and variance to 
be the same as those of p(z) will minimize the divergence; 
this is sometimes known as "moment matching." In particular, 
if p(z) is a mixture of two Gaussians we have the following. 

Lemma 1: The single Gaussian with mean m and variance 
v which minimizes the divergence from the mixture of two 
Gaussians t\ — (nii,«i,Ci) and t 2 = (m 2 ,v 2 , c 2 ), with C\ + 
c 2 = 1, is given by: 



m 

v 



c\rti\ + c 2 m 2 , and 



c±(m\ 



V\) + c 2 (m 2 - 
- 2c\c 2 m,\m 2 



v 2 ) 



(4) 



(5) 



The single Gaussian which satisfies the property of Lemma [T] 
is denoted as: 

t = MM(t 1 ,t 2 ), (6) 

where t — (m, v, 1), with m and v as given in Q and ([5|. 

While it is easy to find the mean and variance of the single 
Gaussian q(z) which minimizes the divergence, the minimum 
divergence itself does not appear to have a closed form. 
Computing the divergence numerically is complex. Instead, 
the squared difference is computed. 

Definition The squared difference SD(p||g) between two 
distributions p(z) and q(z) with support Z is defined as: 



SD(p\\q) = 



(pO) - q(z)) 2 dz. 



(7) 



f(z)-g(z) 




Fig. 1 . Seven Gaussian components (thick dashed line) approximated by three 
Gaussian components (thick solid line), using Gaussian mixture reduction 
algorithm with 8 = 0.01. The Gaussian mixture reduction output g(z) (thick 
dash-dot line) is a good approximation of the input f(z) (thin line). 



The squared difference is non-negative and zero if and only 
if p — q. The squared difference is symmetric. 

Computing the squared distance between a mixture of two 



Gaussians and a single Gaussian is tractable. In Section II-B 



it will be convenient to have a distance measure, or penalty, 
associated with replacing a two-component Gaussian mixture 
by a single Gaussian. Define the Gaussian quadratic loss 
GQL(p) as the squared difference between a distribution p and 
the Gaussian distribution with the same mean m and variance 
v as p: 

GQL(p) = SD(p|| Af(m, «)). (8) 

Lemma 2: For the distribution which is the mixture of two 
Gaussian functions t% = (toi,ui,ci) and t 2 = (m 2 ,v 2 ,c 2 ) 
with ci + c 2 = 1, the Gaussian quadratic loss is given by: 




y /2ir(v + v 1 ) 

2cic 2 
y / 2Tr(v 1 + v 2 ) 



2(» + «i) 



2C«l+«2) 



2c 2 



_g 2(v + v 2 ) 



v 2 ) 



(9) 



where m and v are given in |4]) and Q, respectively. 

When the mixture is not normalized, that is c\ + c 2 ^ 1, 
the GQL is computed by first normalizing the two mixing 
coefficients to sum to one. 



B. Approximating N Gaussians with M Gaussians 

This section describes the Gaussian reduction algorithm 
which approximates a given mixture of N Gaussian functions 
by a mixture of M < N Gaussian functions. Specifically, the 
algorithm input f(z) is the list of triples ti, i = 1, . . . , N, and 
two stopping parameters 8 and M max . The algorithm output 
g(z) is a mixture of M Gaussian distributions, represented 
as a list of M triples. The algorithm is denoted as g(z) = 
GMR(/(z)). 
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The algorithm recursively combines Gaussians in a greedy 
pairwise fashion until two stopping conditions are fulfilled. At 
each recursion step, the pair of Gaussian distributions with 
the smallest GQL are eliminated, and are replaced by the 
single Gaussian with the same moments. In this way, the GQL 
is a one-step error metric. The recursive procedure is halted 
when both of two conditions are satisfied. First, the GQL of 
all remaining pairs of Gaussians is greater than a combining 
limit 8. Second, the number M of Gaussian components in the 
current mixture is less than or equal to Af max . Note that even 
if the GQL of all remaining pairs of Gaussians is larger than 
9, the algorithms continues replacing until M < A/ max . In 
other words, we wish to avoid mixtures with many Gaussian 
components, even if this implies a large one-step GQL. On the 
other hand, the number of output Gaussians may be less than 
M max , if the one-step GQL is sufficiently low. An example 
of the input f(z) and output g(z) is shown in Fig. [I] The 
algorithm is as follows. 
Gaussian Mixture Reduction Algorithm 

1) Input: 

a) List C = {ti,t2, ■ ■ ■ ,tN} of N triples describing 
a Gaussian mixture. 

b) Two stopping parameters, 6 and Af max . 

2) Initialize: 

a) The current search list, C, is the input list: C «— C. 

b) The length of current list, M c <— N. 

c) The minimum GQL between all pairs of Gaussians, 



9 C <- min GQL(fc,t,). 

3) Greedy combining. While 6 C < 9 or M c > M max : 

a) Determine the pair of Gaussians (ti,tj) with the 
smallest GQL: 

(ti,tj) «- arg min GQL(i l; t,). 

ti,tj<EC,i^j 

b) Add the single Gaussian with the same moment as 
U and tj to the list: 

C *-C\JMM(ti,tj). 

c) Delete and tj from list: C <— C\ {ti,tj}. 

d) Decrement the current list length: Af c <— M c — 1. 

e) Recalculate the minimum GQL: 

9 C <- min GQLftj.iA 

4) Output: list of M triples given by C. 

This is a greedy algorithm, and no claims are made about its 
global optimality. However, as demonstrated in the following 
sections, it has good performance when decoding LDLC 
lattices. 

The primary complexity is proportional to N 2 , due to the 
computation of the initial GQL between N pairs at step 2- 
c. Subsequent steps only compute the GQL between the new 
Gaussian and the remaining Gaussians. 



III. LDLC Lattices and Decoder 

A. LDLC Lattices and Unconstrained Power System 

An n-dimensional lattice A is defined by an n-by-n gener- 
ator matrix G. The lattice consists of the discrete set of points 
x = (xi,X2, ■ ■ ■ , x n ) for which 



x = Gb, 



(10) 



where b = (bi, . . . , b n ) is from the set of all possible integer 
vectors, 6, e Z. Lattices are a linear subspace of the n- 
dimensional real space K". 

An LDLC lattice is a lattice with a non-singular generator 
matrix G, for which H = G _1 is sparse and random. It is 
convenient to assume that H has been normalized so that 
| det(ff)| = 1/| det(G)| = 1. A Latin square LDLC code has 
an H matrix with constant row and column weight d, where 
the non-zero coefficients in each row and each column have 
the values hi, hz, . . . , hd with hi > > • • • > hd > 0. The 
sign of each element of H is randomly made negative with 
probability one-half [1]. In the numerical results section, Latin 
square LDLC codes with hi = 1 and hi = 1 /V~d, i = 2, . . . , d, 
are used. 

The unconstrained-power AWGN system, as was considered 
by Sommer et al., is also used here. An arbitrary element x 
of the lattice is transmitted over an AWGN channel. Noise 
Wi with variance a 2 is added to each symbol. The received 
symbols are = Xi + Wi for i — 1,2, ... ,n. A maximum- 
likelihood decoder estimates the transmitted lattice point x as 



x = arg max Pr (y |x j 



(11) 



Since the transmit power is unbounded, the system is 
constrained by the lattice density, measured by the volume 
of the lattice's Voronoi region, V\ — |det(G)|. Somewhat 
analogous to the Shannon limit, there exist lattices such that 
the probability of error, Pr(x ^ x) becomes arbitrarily small, 
if and only if, 

r 2/n 



(7 



vt 



2ire ' 



(12) 



provided the lattice dimension is allowed to grow without 
bound [5] [6]. This system is linear, so for finding error 
probabilities, it is sufficient to consider the transmission of 
the all-zeros lattice point. 

B. LDLC Decoder Using Gaussian Mixtures 

The LDLC belief-propagation decoding algorithm may be 
adapted to incorporate the Gaussian mixture algorithm. The 
check node and variable node functions are decomposed 
into forward-backward recursions, and the Gaussian mixture 
reduction algorithm is applied after each recursion step. 

The decoding algorithm may be presented on a bipartite 
graph with n variable nodes and n check nodes. For Latin 
square LDLC codes, there are nd variable-to-check mes- 
sages /ife(z), and nd check-to-variable messages pk(z), k = 
1,2,..., nd. Associated with variable node i is the channel 
output yi, and channel message is = J\f(z; yi, tr 2 ) (yi is 
distinct from yi(z)). The initial variable-to-check message for 
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Pi(z) Pd-i(z) Pd(z) 

(a) 

Fig. 2. Messages for check node (a) and variable node (b) decoding functions. 



edge k is p k (z) = yi(z), if edge k is connected to variable 
node i. 

Check node Corresponding to each edge k = 1 , . . . , d 
connected to a check node, the input messages are p k (z), the 
output messages are p k (z), and the edge labels from matrix 
H are h k , as shown in Fig. |2]-(a). The check node output 
along some edge is the convolution of the incoming messages 
on all other edges, followed by a shift- and-repe at (periodic 
extension) operation. For example, for edge d, the convolution 
before shift-and-repeat pd{z) is: 

Pd(z) = Mi(t-) *"-*IJ>d-i(r ) (13) 

lii rid-i 

An unknown integer b is from the set B; in the general case B 
is the set of all integers. The shift-and-repeated message Pk(z) 
along edge k is : 



Pk{z) = 



b£B 



p k (b- h k z), 



(14) 



which has a period of if B is the set of integers. 



The check node convolution ( 13 1 can be implemented by a 



forward-backward recursion as follows. First, denote a single 
convolution as: 



a{z)*n(-). 



(15) 



Then, the forward recursion is initialized with an, = 
{(0,0,1)}, in triples representation. The forward recursion 
step is convolution followed by Gaussian mixture reduction, 



atk(z) = C(a fe _i(z),^ fe (^-)) 



a k (z) = GMR(5 fe (z)), 



(16) 
(17) 



for k = 1, . . . , d — 1. Backward messages (3 k (z) and (3 k (z) 
are found in an analogous fashion, initialized with fid = 
{(0, 0, 1)}, with the recursion, 

k (z) = C((3 k+ i(z),n k+1 (z)), (18) 
I3 k (z) = GMR(ft(z)), (19) 

for jfe = d-l,...,2,l. 

The convolution, C(a(z), n(z/h)\ in triples representation 
Ume,Vg,cg)\, can be computed as follows. Let a(z) be a 
mixture of / Gaussians, and let p(z) be a mixture of J 
Gaussians, represented as: 

{«,tM)}' =i and {K,«;,c^)} J =i , (20) 



respectively. Then, the convolution is a mixture of I ■ J 
Gaussians, given by: 



me 



h 2 



3 



(21) 
(22) 
(23) 



where each component i E {1,...,/} of a(z) and each 
component j G {1,..., J} of is pair-wise convolved 
to produce an output component £ E {1, . . . , I ■ J}. 

The forward-backward recursions are completed by com- 
puting the message p k (z) as: 



p k (z) = GMR(an(z)+A( 2 )). 



(24) 



The check node output p k (z) is found by applying the shift- 
and-repeat operation ( fl4| , which increases the number of 
Gaussians by a factor of \B\, where \B\ is the cardinality of 
the set B. 

Variable Node Corresponding to each edge k = 1 , . . . , d 
connected to a variable node, the inputs are messages p k (z) 
and the outputs are messages fi k (z); the channel message is 
y(z), as shown in Fig. |2j-(b). The variable node output along 
some edge is the product of the incoming messages on all 
other edges, including the channel message. For example, the 
output for edge d, pd(z), is: 



y(z) ■ pi{z) ■ Pd-i(z) 



(25) 



The variable node multiplication ( 25 1 can be implemented 
by a forward-backward recursion as follows, and is illustrated 
in Fig. [3] First, denote a single multiplication as: 



V(a{z),p{z)) = a(z)-p(z). 



(26) 



Then, the forward recursion is initialized with ciq(z) — \J y(z). 
The square root of a Gaussian distribution is another Gaussian 
distribution with the same mean and twice the variance, so in 
triples representation the initialization is {(y, 2a 2 , 1)}. Then 
the forward recursion for k = 1 , 2 . . . , d is, 

a k (z) = V(a k -i(z),p k (z)), and (27) 
a k (z) = GMR(5 fc (z)). (28) 

The backward recursion similarly computes messages (3 k , and 
is initialized with /3^(z) = y/y(z), and the recursion is: 

k (z) = V(/9 fc _i(«) s p fc («)) and (29) 
k (z) = GMR(ft(z)). (30) 

The product V(a(z), p(z)j, in triples representation 
{ (me, ve, ce)}, can be computed as follows. Let a(z) be a 
mixture of / Gaussians, and let p(z) be a mixture of J 
Gaussians, represented as: 



{(mf,< )C f)}' =i and { K, } \ 



(31) 
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Vv pi 



a i 



Pk | 



0: 2 




P2 




Ctk-1 



Pk P 



d-2 



Pd-l 



pd Vv 



Fig. 3. Forward-backward variable node decoding, with Gaussian mixture reduction applied after each step. 



respectively. Then, the product is a mixture of I ■ J Gaussians, 

5, (32) 
■4, (33) 



iven by: 




1 


1 




«? 


mi 


mf 


ve 




ce = 





27r(«f + <) 



exp 



TO" 



to]) 2 



(34) 



where each component i E {1,...,/} of a(z) and each 
component j € {1, . . . , J} of p(z) is pair-wise multiplied to 
produce an output component £ £ {1, . . . , I ■ J}. 

The forward-backward recursions are completed by com- 
puting the output message /ifc(z) as: 



H h {z) = GMR(a fc _i(z) -f3 k (z)). 



(35) 



The incoming messages pk(z) have a large number of Gaus- 
sian components, created by the shift-and-repeat operation. 
However, by initializing the recursion with a single Gaussian, 
many components away from this Gaussian have near-zero 
mixing coefficients, and are eliminated or combined by the 
Gaussian mixture reduction algorithm. This effect persists 
as the forward-backward recursions proceed, and in both 
directions. This would not occur in the backward recursion 
using the more naive initialization of ao(z) = y(z) and 
(3d(z) = 1. Note that the output message pk(z) is the product 
ak-i(z) ■ Pk{z)\ since both otk~i{z) and (ik{z) include the 
factor y/ y(z), Pk(z) correctly includes the factor y(z). 

Hard Decisions Hard decisions on the integers, b, may be 
found by first making decisions x = (xi, . .. ,x n ), as, 

d 

Xi = argmaxy(z) p k (z), (36) 

k = l 

where y(z) and p k (z) are the channel message and input 
messages, respectively, for variable node i. Practically, this 
product may be found as ad(z)y / y(z). Then, b may be found 
by rounding each element of the vector i/x to the nearest 
integer. A "word error" occurs if, after a sufficiently large 
number of iterations, b ^ b. Note that x is not necessarily a 
lattice point. 

IV. Numerical Results 

A. Finite-dimensional Lattices 

For finite-dimensional lattices, the Gaussian mixture de- 
coder has error rates similar to the quantized decoder, as 



Mixture Decoder. (1=2 
Mixture Decoder. (1=0.01 
Quantized decoder 
Lattice sphere bound 




n=100 



2 3 4 

SNR (dB) 



Fig. 4. Probability of word error for Gaussian mixture decoder and quantized- 
message decoder. 



shown in Fig. [4] For a dimension 100 lattice with d = 5, 
the performance loss was no greater than 0.2 dB. For a 
dimension 1000 and 10000 lattice both with d — 7, there 
was no discernible performance loss at high signal-to-noise 
ratios when 9 = 0.01. Fig. [4] shows the word error probability 
Pr(h ^ b), rather than error rates for the symbols 6j. The 
SNR is vl /n /2 nea , and for all lattices H is normalized such 
that Va = 1. When the channel noise variance a 2 is equal to 



the capacity, that is, it satisfies eqn ( 12 1 with equality, the SNR 
is dB. 

Instabilities in the decoding algorithm can occur, but these 
may be avoided by setting a minimum message variance. As 
iterations progress, Gaussian variances will decrease towards 
narrow peaks. But due to quantization effects, two narrow 
peaks may not align and the Gaussian mixture reduction 
algorithm will fail to combine them. This was avoided by 
setting a minimum variance. A message variance v is replaced 
by max(w,7), where 7 is a constant. Values in the range 
7 = 10~ 3 to 10~ 4 were used. The quantized-message decoder 
uses a distinct method to avoid instabilities [1, p. 1573]. 

Tarokh, Vardy and Zeger gave a lattice sphere bound, which 
is a lower bound on the probability of word error for the 
unconstrained-power system [5, Theorem 2.2]. This bound was 
shown to be reasonably tight for n = 16, and is expected to 
further improve for increasing dimension. This lattice sphere 
bound is also shown in Fig. |4] For n = 100 LDLC lattices, 
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there is a substantial gap of about 2.5 dB, suggesting that 
there is room for improvement in LDLC lattice design, belief 
propagation decoding, or both. However, this gap decreases as 
the lattice dimension increases. 



B. Complexity-Performance Tradeoff 

The complexity-performance tradeoff for decoding is ob- 
tained through the combining limit 8. Decreasing 8 increases 
the complexity and improves performance. If 8 is high, 
mixtures are allowed to contain many components which 
accurately represent the message and fewer decoding errors 
are made, however, the computational complexity is then 
substantial. On the other hand, lower 8 will lead to decoders 
with lower complexity but with higher error probability. The 
maximum number of allowed Gaussians A/ max was set to a 
large number, for example M max = 1000, without an obvious 
increase in complexity. 

Noise thresholds are used to characterize the performance 
of LDLC lattices. The noise threshold is the lowest SNR for 
which density evolution of an asymptotically large dimension 
lattice converges. For binary low-density parity-check codes on 
the AWGN channel, density evolution can be used, because the 
decoder messages are scalars, and the density (or distribution), 
can be discretized [7]. However for non-binary low-density 
parity-check codes, such as those constructed over finite fields, 
the messages are vectors. True density evolution would use a 
discretized joint density over a vector, which is impractical. 
Nonetheless, although less inefficient, noise thresholds for 
non-binary low-density parity-check codes may be found using 
Monte Carlo density evolution [8]. 

The situation for the Gaussian-mixture LDLC decoder is 
similar to non-binary low-density parity-check codes, because 
the decoder messages consist of multiple means, variances 
and mixing coefficients. Accordingly, LDLC lattice noise 
thresholds are found by Monte Carlo density evolution, as 
follows. At each half iteration samples for iV s = 10 6 nodes 
were randomly drawn from an input pool, and then placed in 
an output pool. The pool has d types of messages to distinguish 
the edges with distinct coefficients hi to hd- The output 
pool becomes the input pool for the next half iteration, and 
this procedure repeats until convergence was detected. Each 
variable node produces an output Xi ^i ven by (36) , and the 
mean-squared error is computed as J^. 5 xf/N s , since the all- 
zeros lattice point is assumed. Convergence is declared when 
the mean-squared error approaches zero. Interest is restricted 
to LDLC lattices with hi = 1 and /12 = • • • = hd = 1 /Vd. 
As such, power is suitably normalized, since such lattices have 
l/|detif| = Va — > 1 as the dimension becomes large. 

The complexity of the decoding algorithm is proportional 
to M 4 , where M is the number of Gaussians in the input 
mixtures. In the forward-backward recursions, the convolution 
or product of two inputs with M Gaussians each produces 
a mixture with M 2 Gaussians, for example eqns 



I81, (29 1 



This mixture of M 2 Gaussians is the input to the Gaussian 
mixture reduction algorithm. Since the complexity of Gaussian 
mixture reduction algorithm is proportional to the square of 
the number of inputs, the net complexity is M 4 . However, M 
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Fig. 5. Performance-complexity tradeoff, parameterized by Gaussian com- 
bining parameter 0; d = 7, A/ max = 1000,B = {-1,0, +1}. Performance 
measure is noise threshold, found by Monte Carlo density evolution. The 
complexity B[Af 4 ] (left-hand axis) is a reasonable predictor of computer 
time (per iteration, right-hand axis); the two y-axes have different zeros due 
to computer overhead. The inset graphs show representative distributions on 
M for the message p(z) associated with edge hi, on iteration 5, for two 
values of 6 = 0.01,0.08. 



is usually much less than M max , and the distribution of M 
indirectly depends on the combining limit 8, channel noise 
a 2 , the iteration number and the code design. 

Accordingly, the expected value of M , E[M 4 ], is used 
to characterize complexity. During Monte Carlo density evo- 
lution, the distribution on the number of Gaussians in the 
message was collected (see Fig. [5] insets), and used to find 
E[M 4 ]. Many messages fik(z) contained a few Gaussian 
components. Likewise, the number of Gaussian components of 
Pk(z) was a small integral multiple of \B\. Because these latter 
messages contain more Gaussians, complexity is dominated by 
the variable node. 

The performance-complexity tradeoff is illustrated in Fig. [5] 
For small values of the combining limit 8 = 0.01, Gaussian- 
mixture decoding has a noise threshold within 0.03 dB of 
the quantized decoder. On the other hand, choosing 8 = 0.1 
increases the noise threshold loss to 0.1 dB, while substantially 
reducing the complexity; further increase in 8 results in more 
performance loss with little complexity improvement. A noise 
threshold of 0.6 dB was found for the quantized-message 
decoder, which corresponds to the symbol-error rate cliff of a 
dimension 10 5 lattice simulated by Sommer et al. 

The complexity of the quantized algorithm of Sommer et al. 
is dominated by a discrete Fourier transform at the check 
node, and thus it is difficult to make complexity comparisons. 
However, the memory required for the Gaussian mixture- 
based LDLC decoder is significantly superior, storing 3M 
values (for the mean, variance and mixing coefficient), for 
each message. On the other hand, the quantized algorithm used 
1024 quantization points for each message. 

V. Discussion 

Codes based upon lattices can achieve the capacity of 
the power-constrained AWGN channel, as has been shown 
by Loeliger [9], Erez and Zamir [10], and others. In fact, 
high-dimension lattices are good in a number of information- 
theoretic contexts [11]. However, for most lattices other than 
LDLC lattices, the decoder complexity is worse than linear 



in the dimension, and so decoding high-dimension lattices to 
approach capacity is infeasible. 

Sommer et al. demonstrated that decoding high-dimension 
lattices is possible by using a linear-complexity belief propaga- 
tion algorithm. Moreover, the error-rate performance of LDLC 
lattices comes with 0.6 dB of the capacity of the unconstrained 
power channel. The unconstrained-power channel is of sig- 
nificant theoretical interest [6], but further study of LDLC 
lattices is needed before they can be applied to more practical 
problems such as the power-constrained AWGN channel. 

The quantized-message decoder used to demonstrate the 
capacity-approaching performance ignores the underlying 
Gaussian nature of the messages. This paper demonstrated that 
it is possible to perform belief -propagation decoding of LDLC 
lattices using a mixture of Gaussian functions as the decoder 
message. The key part was the Gaussian mixture reduction 
algorithm, which was proposed to approximate a mixture of 
a large number of Gaussians with a smaller number of such 
Gaussians. Although it is an approximation, numerical studies 
showed this method to have little or no performance loss. The 
representation of messages as a Gaussian mixture uses far 
fewer parameters than the quantized-message decoder, which 
improves the efficiency of the decoder. In addition, belief- 
propagation decoding using a Gaussian mixture characterized 
by a small number of parameters should advance further study 
of LDLC lattices. 
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